NFTAPE: A Framework for Assessing Dependability in Distributed Systems with Lightweight Fault Injectors

نویسندگان

  • David T. Stott
  • Benjamin Floering
  • Daniel Burke
  • Zbigniew Kalbarczyk
  • Ravishankar K. Iyer
چکیده

Many fault injection tools are available for dependability assessment. Although these tools are good at injecting a single fault model into a single system, they suffer from two main limitations for use in distributed systems: (1) no single tool is sufficient for injecting all necessary fault models; (2) it is difficult to port these tools to new systems. NFTAPE, a tool for composing automated fault injection experiments from available lightweight fault injectors, triggers, monitors, and other components, helps to solve these problems. We have conducted experiments using NFTAPE with several types of lightweight fault injectors, including driverbased, debugger-based, target-specific, simulation-based, hardware-based, and performance-fault injections. Two example experiments are described in this paper. The first uses a hardware fault injector with a Myrinet LAN; the other uses a Software Implemented Fault Injection (SWIFI) fault injector to target a space-imaging application.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Automated Fault-Inject Based Dependability Analysis of Distributed Computer Systems

Recently, there has been interest in developing a dependability benchmarks for computer systems. This will require a way to inject several different types of faults into many different platforms and a way to collect and compare the results. Analyzing complex heterogeneous distributed systems share the same needs. The current approach to building fault injection tool is inappropriate for these g...

متن کامل

A Runtime Dependability Evaluation Framework for Fault Tolerant Web Services

Service-oriented systems are usually built on top of Web service components, which are distributed across the Internet, making dependability a big challenge. In this paper, we propose a runtime dependability evaluation framework for fault tolerant Web services to attack this crucial problem. We first propose a user-collaborative framework for collecting Web service QoS information from both the...

متن کامل

A Generic Approach to Dependability in Overlay Networks

Overlay networks are virtual communication structures that are logically “laid over” underlying hosting networks such as the Internet. They are implemented by deploying application-level topology maintenance and routing functionality at strategic places in the hosting network [1, 2]. In terms of dependability, most overlays offer proprietary “self-repair” functionality to recover from situation...

متن کامل

An Experimental Evaluation of the Coda

Experimental evaluation is an important way to assess distributed systems, and fault injection is the dominant technique in this area for the evaluation of a system’s dependability. For distributed systems, network failure is an important fault model. Physical network failures often have far-reaching effects, giving rise to multiple correlated failures as seen by higher-level protocols. This th...

متن کامل

Improving Dependability of Embedded Software Systems using Fault Bypass Modeling (FBM)

Fault injection techniques are important and widely used for verifying the dependability of computer systems. Traditionally fault injection has been successfully applied for evaluating dependability of hardware electronics and is now increasingly been used for software systems. At the same time increasing complexity of embedded software systems such as in automotive sector has driven these doma...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2000